智能论文笔记

Deep learning for enhanced free-space optical communications

Manon P. Bart , Nicholas J. Savino , Paras Regmi , Lior Cohen , Haleh Safavi , Harry C. Shaw , Sanjaya Lohani , Thomas A. Searles , Brian T. Kirby , Hwang Lee

分类：机器学习

2022-08-15

大气效应（例如湍流和背景热噪声）抑制了在开关键控自由空间光学通信中使用的相干光的传播。在这里，我们介绍并实验验证了卷积神经网络，以降低后处理中自由空间光学通信的位错误率，而自由空间光学通信的位比基于高级光学器件的现有解决方案明显简单，更便宜。我们的方法由两个神经网络组成，这是第一个确定在热噪声和湍流中存在相干位序列以及第二个解调相干位序列的存在。通过生成连贯的光线，将它们与热灯结合在一起，并通过湍流的水箱将其结合起来，通过生成开关的键入键流，可以通过实验获得我们网络的所有数据，从而获得了模拟的湍流，并将其传递给了最终的光线。高度准确性。我们的卷积神经网络提高了与阈值分类方案相比的检测准确性，并具有与当前解调和误差校正方案集成的能力。

translated by 谷歌翻译

Efficient Semantic Segmentation on Edge Devices

Farshad Safavi , Irfan Ali , Venkatesh Dasari , Guanqun Song , Ting Zhu

分类：计算机视觉 | 机器学习

2022-12-28

Semantic segmentation works on the computer vision algorithm for assigning each pixel of an image into a class. The task of semantic segmentation should be performed with both accuracy and efficiency. Most of the existing deep FCNs yield to heavy computations and these networks are very power hungry, unsuitable for real-time applications on portable devices. This project analyzes current semantic segmentation models to explore the feasibility of applying these models for emergency response during catastrophic events. We compare the performance of real-time semantic segmentation models with non-real-time counterparts constrained by aerial images under oppositional settings. Furthermore, we train several models on the Flood-Net dataset, containing UAV images captured after Hurricane Harvey, and benchmark their execution on special classes such as flooded buildings vs. non-flooded buildings or flooded roads vs. non-flooded roads. In this project, we developed a real-time UNet based model and deployed that network on Jetson AGX Xavier module.

translated by 谷歌翻译

Toward Improved Generalization: Meta Transfer of Self-supervised Knowledge on Graphs

Wenhui Cui , Haleh Akrami , Anand A. Joshi , Richard M. Leahy

分类：机器学习

2022-12-16

Despite the remarkable success achieved by graph convolutional networks for functional brain activity analysis, the heterogeneity of functional patterns and the scarcity of imaging data still pose challenges in many tasks. Transferring knowledge from a source domain with abundant training data to a target domain is effective for improving representation learning on scarce training data. However, traditional transfer learning methods often fail to generalize the pre-trained knowledge to the target task due to domain discrepancy. Self-supervised learning on graphs can increase the generalizability of graph features since self-supervision concentrates on inherent graph properties that are not limited to a particular supervised task. We propose a novel knowledge transfer strategy by integrating meta-learning with self-supervised learning to deal with the heterogeneity and scarcity of fMRI data. Specifically, we perform a self-supervised task on the source domain and apply meta-learning, which strongly improves the generalizability of the model using the bi-level optimization, to transfer the self-supervised knowledge to the target domain. Through experiments on a neurological disorder classification task, we demonstrate that the proposed strategy significantly improves target task performance by increasing the generalizability and transferability of graph-based knowledge.

translated by 谷歌翻译

Speech MOS multi-task learning and rater bias correction

Haleh Akrami , Hannes Gamper

分类：人工智能

2022-12-04

Perceptual speech quality is an important performance metric for teleconferencing applications. The mean opinion score (MOS) is standardized for the perceptual evaluation of speech quality and is obtained by asking listeners to rate the quality of a speech sample. Recently, there has been increasing research interest in developing models for estimating MOS blindly. Here we propose a multi-task framework to include additional labels and data in training to improve the performance of a blind MOS estimation model. Experimental results indicate that the proposed model can be trained to jointly estimate MOS, reverberation time (T60), and clarity (C50) by combining two disjoint data sets in training, one containing only MOS labels and the other containing only T60 and C50 labels. Furthermore, we use a semi-supervised framework to combine two MOS data sets in training, one containing only MOS labels (per ITU-T Recommendation P.808), and the other containing separate scores for speech signal, background noise, and overall quality (per ITU-T Recommendation P.835). Finally, we present preliminary results for addressing individual rater bias in the MOS labels.

translated by 谷歌翻译

Learning From Positive and Unlabeled Data Using Observer-GAN

Omar Zamzam , Haleh Akrami , Richard Leahy

分类：计算机视觉

2022-08-26

从积极和未标记的数据（又称PU学习）中学习的问题已在二进制（即阳性与负面）分类设置中进行了研究，其中输入数据包括（1）从正类别及其相应标签的观察结果，（（（ 2）来自正面和负面类别的未标记观察结果。生成对抗网络（GAN）已被用来将问题减少到监督环境中，其优势是，监督学习在分类任务中具有最新的精度。为了生成\ textIt {pseudo}阴性观察，甘恩（GAN）接受了正面和未标记的观测值的培训，并修改了损失。同时使用正面和\ textit {pseudo} - 阴性观察会导致监督的学习设置。现实到足以替代缺失的负类样品的伪阴性观察的产生是当前基于GAN的算法的瓶颈。通过在GAN体系结构中加入附加的分类器，我们提供了一种基于GAN的新方法。在我们建议的方法中，GAN歧视器指示发电机仅生成掉入未标记的数据分布中的样品，而第二分类器（观察者）网络将GAN训练监视为：（i）防止生成的样品落入正分布中; （ii）学习正面观察和负面观测之间的关键区别的特征。四个图像数据集的实验表明，我们训练有素的观察者网络在区分实际看不见的正和负样本时的性能优于现有技术。

translated by 谷歌翻译

Learning from imperfect training data using a robust loss function: application to brain image segmentation

Haleh Akrami , Wenhui Cui , Anand A Joshi , Richard M. Leahy

分类：计算机视觉 | 机器学习

2022-08-08

细分是MRI医学图像分析中最重要的任务之一，通常是许多临床应用中的第一步也是最关键的步骤。在大脑MRI分析中，头部分割通常用于测量和可视化大脑的解剖结构，也是其他应用的必要步骤，例如电脑摄影和磁脑摄影（EEG/MEG）中的电流源重建。在这里，我们提出了一个深度学习框架，该框架可以仅使用T1加权MRI作为输入来分割大脑，头骨和颅外组织。此外，我们描述了一种在嘈杂标签的存在下训练模型的强大方法。

translated by 谷歌翻译

CascadER: Cross-Modal Cascading for Knowledge Graph Link Prediction

Tara Safavi , Doug Downey , Tom Hope

分类：自然语言处理 | 人工智能 | 机器学习

2022-05-16

知识图（kg）链接预测是人工智能中的一项基本任务，在自然语言处理，信息检索和生物医学中的应用。最近，通过使用结合知识图嵌入（KGE）和上下文语言模型（LMS）的合奏，通过利用KGS中的跨模式信息来实现有希望的结果。但是，现有的合奏要么是（1）在排名准确性提高方面并不始终有效，要么（2）由于与深度语言模型的成对排名的组合爆炸问题，在较大数据集上效率不佳。在本文中，我们提出了一种新型的分层排名架构级联，以保持完全结合的排名准确性，同时大大提高效率。 Cascader使用LMS来重新启动更有效的基本毛金属的输出，依靠自适应子集选择方案，旨在最小化LMS，同时最大程度地利用KGE的精度增益。广泛的实验表明，Cascader在KGE基线上最多可提高9分，从而在四个基准上设定新的最先进的性能，同时在竞争性跨模式基线上提高效率一个或多个数量级。我们的经验分析表明，模型跨模式的多样性和保存单个模型的置信度信号有助于解释级联者的有效性，并提出了跨模式级联体系结构的有希望的方向。可以在https://github.com/tsafavi/cascader上获得代码和预估计的模型。

translated by 谷歌翻译

Privacy-Preserving Federated Learning via System Immersion and Random Matrix Encryption

Haleh Hayati , Carlos Murguia , Nathan van de Wouw

分类：机器学习

2022-04-05

联合学习（FL）已成为协作分布式学习的隐私解决方案，客户直接在其设备上训练AI模型，而不是与集中式（潜在的对手）服务器共享数据。尽管FL在某种程度上保留了本地数据隐私，但已显示有关客户数据的信息仍然可以从模型更新中推断出来。近年来，已经制定了各种隐私计划来解决这种隐私泄漏。但是，它们通常以牺牲模型性能或系统效率为代价提供隐私，而在实施FL计划时，平衡这些权衡是一个至关重要的挑战。在本手稿中，我们提出了一个保护隐私的联合学习（PPFL）框架，该框架建立在控制理论中的矩阵加密和系统沉浸工具的协同作用上。这个想法是将学习算法（随机梯度体面（SGD））浸入更高维度的系统（所谓的目标系统）中，并设计目标系统的动力学，以便：浸入原始SGD的轨迹： /嵌入其轨迹中，并在加密数据上学习（在这里我们使用随机矩阵加密）。矩阵加密是在服务器上重新重新格式化的，作为将原始参数映射到更高维的参数空间的坐标的随机更改，并强制执行目标SGD收敛到原始SGD Optiral解决方案的加密版本。服务器使用浸入式地图的左侧逆汇总模型解密。我们表明，我们的算法提供与标准FL相同的准确性和收敛速度，而计算成本可忽略不计，同时却没有透露有关客户数据的信息。

translated by 谷歌翻译